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Inventors: Moises Goldszmidt, Bikash Sabata 
Derek Palma, Amitava Raha 



METHODS AND APPARATUS TO CONTROL DELIVERY 
OF A HOST INFORMATION SYSTEMS 
SERVICE LEVELS AND RESPONSE TIMES 

BACKGROUND OF THE INVENTION 



Cross-Reference to Related Applications 

This application is a continuation-in-part of U.S. Ser. Nos. 

, , , , , , and 



1 0 all filed on the same date as this application, and further identified as 
Attorney Docket Nos. 22436-701, 22436-702, 22436-703, 22436-705, 
22436-706, 22436-707, 22436-708 and 22436-709, which applications are 
incorporated herein by reference. 

15 Field of the Invention 

This invention relates generally to methods and apparatus directed to 
dynamically control of an information system, and more particularly to a 
method and apparatus that delivers consistent and predictable service 
quality for multiple requests in an information system. 

20 

Description of Related Art 

Several protocols exist in which one computer (a "host") receives 
and processes requests from a number of other computers ("clients"). For 
example, in applications involving the world-wide web, a server can receive 
25 and process many concurrent requests from different users on the internet, 
in this example, the server would be the host while each user device would 
be a client. 
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Requests can usually be grouped into sessions, with each session 
each having one or more related requests. For example, a multiple-request 
session could consist of a request requesting information over the world- 
wide web, and an associated response. Alternatively, a multiple-request 
5 session could consist of a commercial transaction, with related requests 
respectively used to locate a web site for a precise product, submit an order 
or billing and shipping information, and convey a confirmation of sale to a 
particular client. Whether a host is to process just a single request or a 
series of related requests, is usually important to quickly, accurately and 

10 completely service each request and each session. 

The term "quality of service" refers a host's ability to, provide quick 
and consistent responses to a request, complete a session and consistency in 
doing so. As a particular host becomes more popular, and due to that 
popularity receives more requests, the host's processing resources can 

15 become stretched. For example, due to heavy traffic, a host may not be able 
to respond to a request at all, or the host may not provide a timely response 
(which can cause a client to "time-out" and generate an error). Poor quality 
of service can have significant consequences, as users may become 
frustrated and simply give up trying to reach a particular host, or the sponsor 

20 of the host may lose sales or fail to communicate needed information to any 
or all clients. 

Two techniques are generally used to alleviate quality of service 
problems. First, more processing capacity can be added to the host, 
typically by either replacing the host with another, more powerful computer, 
25 or by providing multiple computers in parallel and delegating new requests 
to different ones of the multiple computers. While this first technique 
presents an effective way of reducing some quality of service problems, it is 
not always practical. For example, sometimes, due to inadequate planning, 
budgetary constraints or space constraints, additional processing capacity 
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cannot be added. If demand for a host is not properly forecast, there may be 
a long lead time before additional processing capacity can be purchased and 
implemented. Additionally, the processing power may be in the placed 
inefficiently in the information system. 
5 A second technique calls for applying "admission control," where 

only a certain number of client requests are processed ("admitted") and the 
remainder are refused; of the requests which are in fact admitted, all are 
ideally handled in an expedient manner without degradation of quality of 
service as to those admitted requests. An advantage of this technique is that 

10 admission control can be implemented in software, thus facilitating quick, 
inexpensive use with little advance notice. Unfortunately, typical admission 
control mechanisms operate by admitting requests on a request-by-request 
basis, and so, these typical admission control requests do not provide an 
adequate solution for multiple-request sessions. Also, the requests which 

15 are not admitted to the host are generally not handled at all, such that a 
client is not informed that the request has been refused or the client, if 
informed, is simply asked to "try again later. " Typically, a refused client 
must try repeatedly to obtain service with no guarantee that future requests 
will be processed. For these reasons and others, techniques generally used 

20 to alleviate quality of service problems are not always successful. U.S. 
Patent No. 6,006,269, incorporated herein by reference, discloses an 
admission control system having an admission control gateway, a deferral 
manager and a scheduler. When the admission control gateway receives a 
request that calls for a new client session, the gateway determines whether a 

25 processing threshold has been reached; if the threshold has been reached or 
surpassed, the request is passed to the deferral manager to formulate a 
response to the particular client. The scheduler is checked to determine a 
time when the host can expect to have processing resources available, and 
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the deferral manager then formulates a time indication which tells the client 
when the client can expect to gain admission to the host. 

A need exists for an admission control system having an improved 
ability to alleviate quality of service problems. In particular, a need exists 
5 for an admission control system which responds to all requests, whether or 
not those requests are actually admitted. Ideally, such system would operate 
by admitting entire sessions, not just individual requests, such that requests 
relating to a session in-progress are generally admitted. With a system of 
this type, admission control would at least provide a reliable means of 
10 finishing each session with high quality of service. Finally, a need exists for 
a system that provides some committed level of service to all clients, 
including those which may have been initially refused admission. 



SUMMARY OF THE INVENTION 

15 Accordingly, an object of the present invention is to provide a 

control method and apparatus with an improved ability to alleviate quality 
of service problems. 

Another object of the present invention is to provide a control 
method and apparatus that responds to all requests whether or not those 
20 requests are actually admitted. 

Yet another object of the present invention is to provide a control 
method and apparatus that provides some level of service to all clients, 
including those which have been refused admission. 

An object of the present invention is to provide a method and 
25 apparatus that enables e-businesses to deliver predictable and consistent 
service levels when there are sudden and unpredicted changes in traffic and 
infrastructure. 

Still another object of the present invention is to provide a method 
and apparatus that enables e-businesses to deliver multiple differentiated 
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service levels that are predictable and consistent when there are changes in 
traffic and infrastructure. 

Yet another object of the present invention is to provide a method 
and apparatus that enables e-businesses to create different customer classes 
5 and service levels and serve them in terms of priority when there are 
changes in traffic and infrastructure. 

A further object of the present invention is to provide a method and 
apparatus that minimizes server and site meltdown under high level traffic 
conditions. 

10 Yet a further object of the present invention is to provide a method 

and apparatus that proactively and precisely plans and provisions web site 
infrastructures for future growth and for under certain conditions. 

These and other objects of the present invention are achieved in a 
predictive method for admitting a content request to an information system. 

15 A user's quality of service objectives are received at the information system. 
A content request is received that is to be admitted to the information 
system. A determination is made to see if the content request is for an 
existing session or a new session. The content request is then sent to a 
dispatch control function at the information system when the content request 

20 is for an existing session. 

In another embodiment of the present invention, a reactive method is 
provided for admitting a content request to an information system with a 
server. A user's quality of service objectives are received at the information 
system. A content request to be admitted to the information system is 

25 received. A quota is calculated of maximum sessions that the server can 
handle and maintain the user's quality of service objectives. A 
determination is then made to see if the content request exceeds the quota. 

In another embodiment of the present invention, a method is 
provided for admitting a content request to an information system, 
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comprising. A user's quality of service objectives are received at the 
information system. A content request to be admitted to the information 
system is received from a requestor. A determination is made to see if the 
content request is for an existing session or a new session. If the content 
5 request is not part of an existing session then a prediction is made of future 
content requests expected in a predetermined time for the information 
system. 

BRIEF DESCRIPTION OF THE FIGURES 

10 Figure 1 is a flow chart illustrates characterization of an information 

system according to one embodiment of the present invention. 

Figure 2 is a flow chart illustrating a process for characterizing 

software applications by behaviors and content requests according to one 

embodiment of the present invention.. 
1 5 Figure 3 is a flow chart illustrating a process for generating capacity 

and provisioning functions according to one embodiment of the present 

invention.. 

Figure 4(a) is a flow chart illustrating an admission control 
methodology for admitting new content requests to an information system 
20 according to one embodiment of the present invention. 

Figure 4(b) is a flow chart illustrating a predictive admission control 
methodology for admitting new content requests to an information system 
according to one embodiment of the present invention. 

Figure 4(c) is a flow chart illustrating a reactive methodology for 
25 admitting new content requests to an information system according to one 
embodiment of the present invention. 

Figure 4(d) is a flow chart illustrating a process for determining 
which of a predictive or reactive methodology for admitting new content 
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requests to an information system according to one embodiment of the 
present invention. 

Figures 5(a) and 5(b) are flow charts that illustrate process for 
dispatching content requests to a capacity function and a server according to 
5 one embodiment of the present invention.. 

Figure 6 is a flow chart illustrating a process for provisioning an 
information system according to one embodiment of the present invention. 

Figure 7 is a flow chart illustrating a process for dynamically 
controlling components of an information system according to one 
10 embodiment of the present invention. 

DETAILED DESCRIPTION 

The present invention is a method and system that enables, (i) 
consistent and predictable delivery of service levels to content requests in 

1 5 the midst of changes to content request levels and mixes and information 
system infrastructure behavior based on defined business objectives, (ii) 
consistent and predictable delivery of multiple, differentiated service levels 
to different customer classes and service requests in the midst of changes to 
request levels and mixes and information system infrastructure behavior 

20 based on defined business objectives, (hi) absolute prevention of server or 
information system overload ("meltdown") in the midst of extreme and 
unanticipated changes in either traffic or information system infrastructure 
coupled with business defined alternatives on how to best deliver services to 
customer classes and services during these changes and (iv) the ability to 

25 proactively and precisely provision and plan for information systems 
infrastructure growth and service level delivery from the information 
systems which takes into consideration potential changes in content request 
levels, content request mixes, changes to the information system and 
changes to service levels offered. 
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These different customer classes include but not are not limited to, 
most profitable customers, most frequent customers, customers who are 
currently on the information system as compared to those who are about to 
enter the information system, first time customers, customers specifying a 
5 specific content request such as book buying verses book browsing, and the 
like. Examples of different service levels include but are not limited to, 
guaranteed fastest access to the information system, guaranteed response 
times from the information system, guaranteed transaction protection during 
heavy traffic conditions, guaranteed information system access during all 
10 conditions, and the like. Priority of serviced is determined by the manager 
of the information system in line with its business objectives. It will be 
appreciated that the present invention is not limited to the preceding. 

A host processing system can be utilized to provide priority access 
based on multiple classes of service and deferral of certain content requests. 
15 A web page can be downloaded to a client having automatic or elective 
attempts to later attain access. The host processing system allocates 
incoming content requests to one or more processing tasks according to 
priority or class of service (each referred to as "priority"). Priority can be 
associated with each content request by, an admission control and 
20 dispatching system , information contained within a content request, a client 
system, the server or other components of the information system, and the 
like. 

In one embodiment, priority can be assigned to content requests 
upon deferral because the server is too busy to handle new sessions 
25 represented by the content requests such that when later re-submitted, the 
content requests are then-handled on a priority basis. In one example, 
deferred content requests can be specifically assigned an appointment for re- 
submission at a time when it is thought that the server can guarantee priority 
processing to the content request. Additionally, the information system can, 
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send a message to the requestor, be unavailable for the content request, be 
unavailable for the content request for a selected period of time, queue the 
content request for admission to the information system, gracefully degrade 
a quality service compliance of sessions currently existing in the 

5 information system gracefully degrade a quality service compliance of new 
sessions incoming to the information system, gracefully degrade new and 
existing sessions and gracefully degrade lower priority customers as defined 
in the user's quality of service objectives. It will be appreciated that the 
handling of deferred content requests can employ various combinations of 

10 the above and that the present invention is not limited to the preceding 
examples which are given by way of illustration. 

For purposes of these specification, gracefully degrade means 
incrementally relaxing one or more of the user's quality of service 
objectives, including but not limited to response time, probability of 

1 5 response time, consistency of response time, number of concurrent users, 
number of concurrent content requests, and the like, for various customer 
classes and services. 

In one embodiment, the host processing system includes an 
admission control system that normally admits, delays and/or rejects 

20 content requests from a client system to a server. If processing resources of 
the of the information system are strained, the admission control system can 
admit content requests based the user's quality of service objectives and 
priorities. Admission control software operates principally on a server 
within the information system. 

25 Examples of the user's quality of service objectives include but are 

not limited to, speed of content delivery for a specified time, consistency of 
speed of content delivery, information system response time, information 
system response time consistency, the number of concurrent users or 
content requests and the like. 
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Referring now to Figure 1, one embodiment of a method for 
determining the behavior of an information system application is illustrated. 
As illustrated, the information system includes at least one software 
application, a network information delivery system as well as combinations 

5 of software applications with network information delivery systems. 
Examples of software applications include but are not limited to e- 
commerce and other internet based transactions such as browsing, buying, 
requesting specific information or pages, and the like. Each network 
information delivery system has server and network components which may 

10 include but are not limited to one or more of the following as well as 

combinations of a, web-server, application server, data base server, fire wall 
server, secure transaction server, load balancer, web switch, network quality 
of service manager, network bandwidth manager, network traffic shaper, 
cache, content delivery system, and the like. These information system 

1 5 components may reside in one or more geographic locations and connected 
by a variety of different networking configurations and technologies well 
known to those skilled in the art. 

The behavior of the information system application for content 
requests, volumes and mixes, and various load conditions is determined. 

20 Load conditions are the amount of consumed resources of the information 
system at a given point in time. Examples of consumed resources include 
but are not limited to, utilization of computing resources, utilization of 
network bandwidth resources, utilization of database resources, and the like. 
The user's quality of service objectives are then ascertained for various 

25 customer classes and services. The information system's capacity allocation 
is then prioritized to best meet the user's quality of service objectives. On 
an instantaneous and on-going bases, changes are detected in the 
information system application and delivery system behaviors as a result of 
changes in content request patterns, volumes and mixes. In response to 
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detecting changes that effect a committed or guaranteed delivery of the 
user's quality of service objectives, the behavior of the information system 
and/or application is updated periodically or dynamically in order to better 
meet the user's quality of service objectives. 

5 Additionally, the information system application's behavior can 

change due to, software or hardware revisions and changes, different traffic 
volumes, mixes and content requests. Changes in traffic mix can include 
changes in content requests, geography or user types, software and hardware 
revisions or changes. 

10 Detecting changes in the information system application can result 

in a determination of control techniques required by software and/or 
network components of the information system application in order to 
insure that the user's quality of service objectives are met. Examples of 
detecting changes in the information system include determining the amount 

15 of capacity that needs to be reserved in the information system application 
to insure that the user's quality of service objectives are met, updating 
behavior characteristics of the information system based on recent and/or 
historical observations of the information system application, updating 
behavior characteristics of the information system based on recent and 

20 historical observations of the information system application. Examples of 
control techniques include but are not limited to, changing the policies or 
weightings of load balancers, web-switches, and/or network bandwidth 
managers, in order to reallocate patterns of content requests and meet the 
user's quality of service objectives. These control techniques can be 

25 initiated as a result of an observed changes in content request volume or mix 
that potentially impact the user's quality of service objectives. Additionally, 
the control techniques can be initiated as a result of a change in the 
information system infrastructure such as the addition or deletion of 
computing or networking components as well as additions, deletions or 
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revision of software applications, and other changes in the information 
system that can impact the user's quality of service objectives. 

In another embodiment of the present invention, the behavior at least 
one software application coupled to the network information delivery 

5 system is determined for various levels and patterns of user content 
requests. Then, the network information delivery system' s behavior is 
determined as a function of user content requests. The user's quality of 
service objectives are ascertained and a capacity allocation of the network 
information delivery system is prioritized based on the user's quality of 

1 0 service objectives and priorities. Changes in the software application and/or 
network information delivery system are then determined. The behaviors of 
the software application and the network information delivery system are 
updated dynamically in response to detecting changes that effect the user's 
quality of service objectives. 

15 The ability to characterize the software application and information 

system, as a function of various user content request volumes and mixes, 
coupled with the control techniques described above, allows the information 
system to deliver consistent and predictable service levels. These are 
delivered in the midst of unanticipated changes from the infrastructure 

20 and/or the content request side consistent with the user's quality of service 
objectives and priorities.. 

Referring now to Figure 2, after the behavior of an information 
system is determined the software application is then characterized via a 
method of grouping content request's by one or more behavior labels for 

25 each content request. Sessions for various user and service types are 

defined. Sessions are a group of one or more content requests. The sessions 
are then modeled to create representative sessions. Each session is then 
matched with one or more representative sessions. 
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A representative session can be a characterization of a group of 
content requests in terms of common properties. Common properties are 
typically behavior or statistical properties. Examples of behavior properties 
include capacity usage, the time required to process a new content request, 
5 the amount of capacity required to process the new content request and the 
number of database interactions of the new content request the amount of 
response time required to process the new content request. 

New content requests include but are not limited to URL's. Each 
session can be a group of URL's, one or more content requests and one or 
1 0 more content requests from a single requestor. The requestor is either a 
human being or a machine such as another server or computer from the 
same or different information system. 

Each session can be defined by its service type, number and type of 
content requests it contains and content requests requested by a given 
1 5 browser. Content requests are labeled according to the content request's 
behavior characteristics under a specific application, traffic, load and 
information system conditions. 

In another embodiment of the present invention, a method of 
clustering content requests by one or more behavior labels is provided. 
20 Sessions are defined for various user and service types. The sessions are 
modeled to create representative sessions. Each session is then matched 
with one or more representative sessions. 

In another embodiment of the present invention, a method of 
clustering content requests by grouping by one or more behavior labels for 
25 each content request by the user's quality of service objectives is provided. 
Sessions are defined for various user and service types. The sessions are 
once again modeled to create representative sessions and each session is 
then matched with one or more representative sessions. 
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The present invention also provides a method of grouping content 
request's by one or more behaviors. Each content request is labeled by mix. 
Sessions are defined for various user and service types. Again, the sessions 
are modeled to create representative sessions and each session is matched 
5 with one or more representative sessions. 

In another embodiment, content request's are grouped by one or 
more behaviors by labeling each content request by capacity. Sessions are 
then defined as before followed by modeling to create representative 
sessions and matching each session with one or more representative 
10 sessions. 

Referring now to Figure 3, after the application is characterized a 
capacity function and provisioning function of the information system are 
generated. In one embodiment, a method of defining the required 
information delivery system capacity, as a function of the user's service 

1 5 quality objectives, is provided. The information delivery system monitors 
behavior to understand under what conditions the user's service quality 
objectives are met or not met. The conditions in which the user's service 
quality objectives are met or not met are captured. Statistical techniques are 
applied to the conditions captured. Examples of statistical techniques 

20 include but are not limited to regression analysis, Baysian modeling, and the 
like. A model is then created that describes the conditions in which the 
user's service quality objectives are met or not met. 

Creation of the model can take various forms such as analytical 
techniques to minimize an error associated with the model, statistical 

25 techniques to minimize an error associated with the model and probability 
techniques to minimize an error associated with the model. Additionally, 
the model can be used to predict a state of the information system required 
to meet selected user quality of service objectives. 
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A flow chart of the admission control methodology of the present 
invention is illustrated in Figure 4(a). The present invention also provides 
methods to determine when and how to admit a content request to the 
information system. These methods may be predictive, reactive and 
5 combinations of the two. The user's quality of service objectives and a 
content request are received at the information system. A determination is 
made to ascertain if the content request is for an existing new session. The 
content request is sent to a dispatch control function when the content 
request is for an existing session. 
10 After the user's quality of service objectives and new content request 

are received at the information system a determination is made to ascertain 
if the content request is for an existing session or a new session. When the 
content request is not part of an existing session, future content requests 
expected in a predetermined time for the information system are predicted. 
1 5 New content requests expected in the predetermined time can then 

be aggregated with existing content requests currently being processed by 
the information system to create an aggregated content capacity request. 
New content requests and current content requests are then processed to 
determines if the information system can process the aggregated content 
20 capacity request in compliance with the user' s quality of service objectives. 
This processing is based on behavior of the information system for user 
content requests, volumes and mixes, and various load conditions as 
disclosed in Figure 1. The content request is either accepted or rejected. If 
accepted, the content request is then sent to dispatch control as described in 
25 Figures 5(a) and 5(b) hereafter. When the content request is rejected, it is 
processed according to a user defined rejection rule. 

The user defined rejection rule can include, sending a message to the 
requestor, making the information system unavailable for the content 
request, making the information system unavailable for the content request 
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for a selected period of time, queuing the content request for admission to 
the information system, gracefully degrading a quality service compliance 
of sessions currently existing in the information system gracefully 
degrading a quality service compliance of new sessions incoming to the 
5 information system, gracefully degrading new and existing sessions and 
gracefully degrading lower priority customers as defined in the user's 
quality of service objectives. 

Figure 4(b) illustrates a method for determining a number of future 
content requests that will arrive at the information delivery system for a pre- 

10 determined future period of time. A number of models are created that 

predict the number of future content requests. A determination is made for 
each of its respective predictions for the pre-determined future period of 
time. A model is then selected which has the least error, or lowest penalty 
function, associated with its prediction to create a best model predictive 

15 assessment of the next interval's number of content requests. The number 
of current content requests is added to the predicted future content requests 
to create an aggregate total number of content requests. The aggregated 
total number of content requests is then sent to a capacity function to 
determine if at that instant the information system has enough available 

20 capacity to service the aggregated number of content requests in compliance 
with the user's quality of service objectives. 

A penalty function is a method to determine the accuracy of a model 
that predicts the number of content requests. An example of a suitable 
function useful with the present invention is achieved by means of actual 

25 observation of the number of content requests during a selected time period 
which are then compared to the predicted value. This assessment may be 
performed at any instant in time or over a period of time in order to 
construct a selection function which determines the predictive models that 
are the most accurate. Such assessments are dynamic and changed with 
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modifications of the user's quality of service objectives or changes to the 
information system. The selection function can include construction of a 
probability distribution over a set of predictive models. The construction of 
the probability distribution then determines the accuracy of the models and 
5 the stochastic selection of the models according to this distribution. An 
example of a probability distribution is: 



^ ^n{gain-gain } ) 
7=1 

where n is a free parameter called "learning rate" and gain; is an 
accumulated penalty of algorithm i and P[j] is the probability for algorithm 



Prediction(i) = alpha* observation + (1 - alpha)*Prediction(i-l) 
for different values of alpha. 

Referring now to Figure 4(c), an example of a reactive method for 



1 5 requests that the information system can process, without compromising the 
user's quality of service objectives, is illustrated. In this embodiment, a 
quota of maximum sessions that a server can handle while still maintaining 
the user's quality of service objectives is calculated. The quota calculation 
can be achieved by observing a fixed number of content requests and also 

20 be determined by the number of times the user's quality of service 

objectives have been violated divided by the number of content requests. 

A determination is then made to see if the content request exceeds 
the quota. The content request is sent to the server if the quota is not 
exceeded. If the quota is exceeded, and the user's quality of service 

25 objectives are met, the content request is sent to a throughput computation 
to determine whether or not the server can process any more content 




10 j. Additionally, each model can be of the form (recursive): 



producing pre-determined estimates of the volume and mix of content 
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requests. If the quota is exceeded, and the user' s quality of service 
objectives are not met, the content request is rejected, redirected or delayed. 
When the quota is exceeded, and the user's quality of service objectives are 
not met, then the user's quality of service objectives can be downgraded. 
5 The content request is then sent to the throughput function whether or not 
the server can process any more content requests. 

The throughput calculation can be a capacity utilization of the server 
using content request arrival rates. Additionally, the throughput calculation 
can be latencies and a quota of maximum content requests that the server 

10 can handle while still maintaining the user's quality of service objectives. 

Referring now to Figure 4(d), a control method for admitting a 
content request to an information system includes determining if a 
predictive or a reactive control method provides a better fit to the user's 
quality of service objectives. After this determination is made, the result is 

1 5 dispatched and sent to a web-server. Additionally, on-going statistical 
results of the predictive and reactive functions can be maintained to 
determine if the predictive or reactive method is better at any instant in time 
or over a historical period of time. Dynamic switching, via real time 
feedback to the information system, is made between the predictive or 

20 reactive methods based on which one is better is meeting the user quality of 
service objectives. 

It will be appreciated that the present invention is not limited to the 
preceding specific examples of predictive and reactive control methods. 
As illustrated in Figures 5(a) and 5(b), the present invention also provides a 

25 method for admitting content requests to an information system that 

includes a server and a queue of content requests coupled to the server. A 
determination is made to ascertain if the queue is empty. The capacity 
function of the server is checked to see if the server has the capacity to 
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admit the new content request. When the server has capacity, and the 
Queue is empty, the new content request is dispatched to the server. 

Additionally, the throughput capacity function of the server is 
checked to determine if the server can admit the new content request. When 
5 the server has the throughput capacity, the new content request is dispatched 
to the server. 

In another embodiment, when the Queue is empty and the server has 
the capacity to receive new content requests, the new content request is then 
dispatched to the server. In the event the server does not have capacity, then 

10 the new content request remains in the Que. Thereafter, the server's 

capacity is periodically checked.. Alternatively, the server is checked for 
capacity whenever a content request is completed and leaves the server. 

In another embodiment, the new content request is only admitted to 
the Queue when it is empty. Then, the server is checked for capacity when 

1 5 a content request is completed and leaves the server. It yet another 

embodiment, when a content request is completed and leaves the server the 
Queue is checked to see if it is empty. When the Queue is not empty the 
capacity of the server is checked. If the server has capacity then the new 
content request is admitted to the server. If the server doesn't have capacity 

20 the new content request remains in the Que. 

In another embodiment, new content requests are dispatched to the 
information system. The Queue is periodically checked, irregardless of 
whether or not a content request is admitted to or leaves the server. 

Referring now to Figure 6, a provisioning model is created. The 

25 provisioning model prepares the capacity and behavior of the information 
system to accommodate and to meet the user's quality of service objectives. 
A software application is characterized based on its impact on the capacity 
of the information system. Additionally, utilization of the capacity of the 
information system is characterized for software applications, software or 
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hardware revisions and changes, different traffic volumes, mixes and 
content requests. 

A prediction of aggregated incoming and exiting sessions of the 
information system is received. A model is then produced that correlates a 
5 number of sessions to the capacity utilization. Each session is broken down 
into individual content requests. The model is then applied to individual 
content requests to determine if the information system has sufficient 
capacity to process the aggregated content requests while meeting the user's 
quality of service objectives. 

1 0 Referring now to Figure 7, the present invention can also 

dynamically control the information system in order to enforce the user's 
quality of service objectives. In this embodiment, a capacity function is 
checked to see if the information system has sufficient capacity to process 
an incoming content request in compliance with the user's quality of service 

15 objectives. If the information system does not have sufficient capacity at 
that instant, then a real time control signal may be sent to a component 
within the information system. The real time control signal changes the 
component's policy, weighting or behavior in a manner to better meet the 
user's quality of service objectives. Again, the components include but are 

20 not limited to a, web-server, application server, data base server, fire wall 
server, secure transaction server, load balancer, web switch, network quality 
of service manager, network bandwidth manager, network traffic shaper, 
cache, content delivery system, and the like. For purposes of this 
specification, a real time control signal can be, a change in a traffic 

25 allocation rule, a change of a traffic allocation weighting of the information 
system components, a change of bandwidth allocation weighting of the 
information system components, an initiation of server overflow resources, 
and the like. The real time control signals are sent to the information 
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system by a variety of means including but not limited to, a command line 
interface, a application program interface, one or more cookies, and the like. 

If the Queue is not empty the capacity of the server is checked. 
When the server doesn't have capacity the new content remains in the Que. 

5 A determination is then made of actions and commands to send to the 
information system components. These actions and commands place the 
information system components in a more readily suitable state for meeting 
the user's service of quality objectives. Additionally, conditions are 
determined for when the information system components can meet the 

10 user's quality of service objectives. At least one real time control signal is 
sent to the information system components in response to determining the 
conditions in which the information system components can meet the user's 
quality of service objectives. The information system components are 
subsequently placed in a state to meet the user's quality of service 

15 objectives. The behavior of the information system components is modified 
in order to change the capacity allocation of the server. 

The foregoing description of a preferred embodiment of the 
invention has been presented for purposes of illustration and description. It 
is not intended to be exhaustive or to limit the invention to the precise forms 

20 disclosed. Obviously, many modifications and variations will be apparent 
to practitioners skilled in this art. It is intended that the scope of the 
invention be defined by the following claims and their equivalents. 
What is claimed is: 
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CLAIMS 



1 1. A predictive method for admitting a content request to an 

2 information system, comprising: 

3 receiving a user' s quality of service obj ectives at the information 

4 system; 

5 receiving a content request to be admitted to the information system; 

6 determining if the content request is for an existing session or a new 

7 session; and 

8 sending the content request to a dispatch control function at the 

9 information system when the content request is for an existing session. 

1 2. The method of claim 1, wherein the user' s quality of service 

2 objectives include speed of content delivery for a specified time. 

1 3 . The method of claim 1, wherein the user' s quality of service 

2 objectives include consistency of speed of content delivery. 

1 4. The method of claim 1, wherein the user's quality of service 

2 objectives include a function of number of concurrent users. 

1 5. The method of claim 1, wherein the user's quality of service 

2 objectives include system response time. 

1 6. The method of claim 1, wherein the user's quality of service 

2 objectives include system response time consistency. 

1 7. A reactive method for admitting a content request to an 

2 information system with a server, comprising: 
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3 receiving a user' s quality of service objectives at the information 

4 system; 

5 receiving a content request to be admitted to the information system; 

6 calculating a quota of maximum sessions that the server can handle 

7 and maintain the user's quality of service objectives; and 

8 determining if the content request exceeds the quota. 

1 8. The method of claim 7, further comprising: 

2 sending the content request to the server if the quota is not exceeded. 

1 9. The method of claim 7, further comprising: 

2 if the quota is exceeded and the user' s quality of service objectives 

3 are met, then sending the content request to a throughput computation to 

4 determine whether or not the server can process any more content requests. 

1 10. The method of claim 7, further comprising: 

2 if the quota is exceeded and the user's quality of service objectives 

3 are not met, then rejecting the content request. 

1 11. The method of claim 7, further comprising: 

2 if the quota is exceeded and the user's quality of service objectives 

3 are not met, then downgrading the user's quality of service objectives and 

4 sending the content request to a throughput function whether or not the 

5 server can process any more content requests. 

1 12. The method of claim 9, wherein the throughput calculation is 

2 a capacity utilization of the server using content request arrival rates, 

3 latencies and a quota of maximum content requests that a server can handle 

4 and maintain the user's quality of service objectives. 

1 13. The method of claim 7, wherein the quota calculation is done 

2 by observing a fixed number of content requests. 
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1 14. The method of claim 7, wherein the quota calculation is 

2 determined by the number of times the user' s quality of service objectives 

3 have been violated divided by the number of content requests. 

1 1 5. A method for admitting a content request to an information 

2 system, comprising: 

3 receiving a user's quality of service objectives at the information 

4 system; 

5 receiving a content request to be admitted to the information system 

6 from a requestor; 

7 determining if the content request is for an existing session or a new 

8 session; and 

9 if the content request is not part of an existing session then 

1 0 predicting future content requests expected in a predetermined time for the 

-J 1 information system. 

■ = 1 16. The method of claim 15, further comprising: 

= 2 aggregating new content requests expected in the predetermined 

Z 3 time with existing content requests currently being processed by the 

= 4 information system and create an aggregated content capacity request. 

1 17. The method of claim 16, further comprising: 

2 determining if the information system can process the aggregated 

3 content capacity request in compliance with the user' s quality of service 

4 objectives. 

1 18. The method of claim 17, further comprising: 

2 accepting or rejecting the content request. 
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1 19. The method of claim 18, further comprising: 

2 sending the content request to dispatch control if the content request 

3 is accepted. 

1 20. The method of claim 18, further comprising: 

2 sending the content request to a user defined rejection rule if the 

3 content request is rejected. 

1 21. The method of claim 20, wherein the user defined rej ection 

2 rule includes sending a message to the requestor. 

1 22. The method of claim 20, wherein the user defined rej ection 

2 rule includes making the information system unavailable for the content 

3 request. 

1 23 . The method of claim 20, wherein the user defined rejection 

2 rule includes making the information system unavailable for the content 

3 request for a selected period of time. 

1 24. The method of claim 20, wherein the user defined rejection 

2 rule includes queuing the content request for admission to the information 

3 system. 

1 25. The method of claim 20, wherein the user defined rejection 

2 rule includes gracefully degrading a quality service compliance of sessions 

3 currently existing in the information system. 

1 26. The method of claim 20, wherein the user defined rejection 

2 rule includes gracefully degrading a quality service compliance of new 

3 sessions incoming to the information system. 
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1 27. The method of claim 20, wherein the user defined rejection 

2 rule includes gracefully degrading new and existing sessions. 

1 28. The method of claim 20, wherein the user defined rejection 

2 rule includes gracefully degrading lower priority customers as defined in 

3 users' quality of service objectives. 
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METHODS AND APPARATUS TO CONTROL DELIVERY 
OF A HOST INFORMATION SYSTEMS 
SERVICE LEVELS AND RESPONSE TIMES 



ABSTRACT 

A predictive method is provided for admitting a content request to 
an information system. A user's quality of service objectives are received at 
the information system. A content request is received that is to be admitted 
to the information system. A determination is made to see if the content 
request is for an existing session or a new session. The content request is 
then sent to a dispatch control function at the information system when the 
content request is for an existing session. 
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